The Logic of Exploratory and Confirmatory Data Analysis

نویسنده

  • Elissaios KaragEorgiou
چکیده

Exploratory Data Analysis (EDA) and Confirmatory Data Analysis (CDA) are two statistical methods widely used in scientific research. They are typically applied in sequence: first, EDA helps form a model or a hypothesis to be tested, and then CDA provides the tools to confirm if that model or hypothesis holds true. When both analyses are applied within a single experiment, two main types of errors can occur that fall under the general term of selection bias. One error is the biased selection of the set of data used to confirm the model derived by the EDA. The other error occurs when CDA becomes part of EDA instead of being applied after EDA completion. As a result of selection bias, overfitting of a model can occur in a manner that makes the model stand true only narrowly, i.e. for the specific sample from which it was derived, without any generalizability. This bias in planning the analysis occurs frequently in the literature. This paper provides the theoretical background and the conceptual tools by which to identify such errors in the literature and to carry out the analysis properly. Applications of EDA and 35 exploratory and Confirmatory data analysis CDA in medical biomarker research are used as paradigms for clarification of concepts. “έστιν ουν επιστήμη δόξα αληθής μετά λόγου” (Science is affirmed knowledge through logical arguments.)

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

ساختار عاملی نسخه تجدیدنظرشده فارسی پرسش‌نامه شخصیتی نئو در ایران

AbstractObjectives: In view of the existing methodological drawbacks in the previous investigations of NEO-Personality Inventory-Revised, the present study aimed at reinvestigating the construct validity of this inventory in Iran. Method: 334 students 231 female and 98 male, 5 not specified) with a mean age of 23.1 were selected via convenience sampling from Tehran, Elm-o-Farhang and Shahid Beh...

متن کامل

Validity & reliability of the Persian version of Grasha-Richmann student learning styles scale

Introduction: The present study aimed to investigate the psychometricproperties of Grasha-Riechmann Student Learning Styles Scale.Method: The participants included 1039 students (421 students in human and 618 students in technical sciences), selected through the stratified sampling method from Tehran University. They answered the Grasha-Riechmann student learning style scale and the data was an...

متن کامل

تعیین روایی و پایایی پرسشنامه ارزیابی مدیریت بحران مبتنی بر اصول هفت‌گانه مهندسی مقاومت پذیری در بیمارستان‏ها

 Background and aims: Since assessment of crisis management has important role in planning for training and increase awareness and preparation hospitals, therefore the usage of new Resilience Engineering approach can help to increase efficiency of crisis management and empowerment hospitals to encounter with crises. The aim of this study was to evaluate of validity and reliability of the q...

متن کامل

Developing a Critical Checklist for Textbook Evaluation

This study has been carried out to develop a critical checklist for global/commercial textbooks which play a crucial role in language teaching/learning. For this aim, a number of items have been developed based on a comprehensive review of the related literature and experts’ opinions. The tentative checklist was administered to the targeted population, yet 326 checklists were deemed appropriate...

متن کامل

تعیین روایی و پایایی پرسش‌نامه رفتار رانندگان اتوبوس شهر تهران در سال 1391: تحلیل عامل اکتشافی و تأییدی

  Background and Objective: Most accidents can be directly attributed to human factors. The aim of this study is to determine the validity and reliability of drivers aberrant behavior questionnaire in urban Bus Company. Materials and Methods: This descriptive study was performed on 168 subjects (in order to exploratory factor analysis) and 161 subjects (for the sake of confirmatory f...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011